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1.  Introduction 


The  family  of  exponential  distributions  has  fundamental  meaning  in  reliability  theory, 
survival  analysis  and  general  in  the  area  of  life  time  distributions.  For  an  overview  and  more 
details  we  refer  to  Johnson,  Kotz  and  Balakrishnan  (1994)  and  Balakrishnan  and  Basu 
(1995).  We  consider  k  independent  exponential  populations  7Ti, . . . ,  irk  with  expectations 
$i, . . . ,  &k  which  are  unknown.  Let  there  be  a  control  value  #o-  Each  population  % i  is  called 
good  if  9i  >  9q  and  bad  otherwise.  We  study  the  problem  of  finding  all  good  populations. 
This  is  a  typical  subset  selection  problem,  see  Gupta  and  Panchapakesan  (1985).  We 
assume  that  the  9i  are  random  and  distributed  according  to  the  unknown  distribution 
G{.  Then  for  a  given  loss  function  the  best  selection  rule,  being  the  Bayes  selection  rule, 
depends  on  the  unknown  G[.  We  suppose  that  historical  data  are  available  and  can  be 
included  in  the  decision  rule.  This  is  the  empirical  Bayes  approach  due  to  Robbins  (1956). 
Empirical  Bayes  methods  have  been  applied  in  different  areas  of  statistics.  Deely  (1965) 
constructed  empirical  Bayes  subset  selection  procedures.  In  a  series  of  papers  Gupta 
and  Liang  (1988,  1994)  and  Gupta,  Liang  and  Rau  (1994a,  1994b)  have  studied  different 
selection  procedures  using  empirical  Bayes  approach. 

Assume  Y  are  the  actual  data  based  on  which  we  wish  to  make  a  decision.  Then 

k 

the  optimal  decision  do  depends  on  the  unknown  joint  distribution  G  =  n  Q  °f  £  == 

i=l 

(#i, ... ,  9k)-  The  central  idea  of  the  empirical  Bayes  approach  is  the  construction  of  a  good 
decision  rule  d*  on  the  basis  of  historical  data  Y_n-  The  quality  of  d*  is  then  characterized 
by  the  non-negative  random  regret  risk  R*  =  R(d* ,  G)  —  R(da ,  G).  The  aim  of  the  above 
mentioned  papers  dealing  with  empirical  Bayes  methods  was  to  construct  suitable  decision 
rules  d*  and  to  evaluate  the  non-random  regret  risk  ER*  .  The  main  goal  of  these  papers 
was  to  prove  the  convergence  of  ER*  to  zero  with  a  certain  rate.  Gupta  and  Liang  (1996) 
constructed  for  the  problem  of  selecting  good  exponential  populations  a  selection  rule  d* 
and  proved  ER*  =  0(n~A/2)  with  same  0  <  A  <  2. 

The  natural  question  is  whether  there  exist  other  selection  procedures  which  are  pos¬ 
sibly  better  in  asymptotic  sense.  But  if  ER*  =  0(n~1)  then  a  comparison  with  another 
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sequence  dn  of  selection  rules  would  lead  to  the  constants 

lim  nE[R(d*n,G)  -  R{dG,G)\ 

n— oo 

lim  nE[R(dn,G)  -  R(dG ,  G)] 

n— >•  oo 

which  have  to  be  calculated  and  compared.  At  least  two  reasons  are  against- this  argument. 
Even  if  the  stochastic  order  of  i2*  is  Op(^),  the  regret  risk  Ei?*  may  not  have  the  order 
0(~)  as  the  values  of  i?*  can  be  large  on  some  events  with  probability  tending  to  zero. 
As  these  events  do  not  occur  in  most  cases,  the  random  regret  risk  better  reflects  the 
situation  in  which  the  empirical  Bayes  methods  are  applied.  These  methods  behave  better 
than  indicated  by  the  order  of  the  regret  risk  EA* .  The  situation  is  comparable  with 
the  asymptotic  theory  of  parameter  estimation  where  different  types  of  estimators  are 
compared  by  the  limit  distribution  of  \/n(9n  —  9q)  and  in  general  not  by  a  direct  evaluation 
of  the  variance  of  9n.  A  second  more  technical  argument  is  that  lim  nE[R(d*,G)  — 

n— >oo 

R(dG,  G)]  can  be  calculated  only  in  very  special  situations.  So,  in  this  paper  we  study 
distributions  instead  of  expectations.  A  new  selection  rule  dn  for  the  problem  of  selecting 
a  good  exponential  distribution  is  introduced  by  a  modification  of  the  Gupta,  Liang  rule 
(1996).  We  show 

n[R(dn,G)  -  R(dG,G)} 

converges  in  distributions  to  a  linear  combination  of  independent  x2~' distributed  random 
variables  each  with  one  degree  of  freedom.  The  coefficients  in  the  linear  combination 
are  explicitly  calculated.  The  main  idea  for  the  new  selection  rule  is  the  fact  that  the 
construction  of  the  optimal  selection  rule  dG  needs  only  the  value  of  a  unique  zero  rjio  of 
some  function  Hi  which  depends  on  Gi.  The  main  part  of  this  paper  is  the  construction 
of  a  suitable  estimator  rjin  for  77^  and  the  proof  of  a  limit  theorem  for  v^(^7m  —  Vio)- 

2.  Formulation  of  the  Selection  Problem 

Consider  k  independent  exponential  populations  7Ti,...,7 r&  which  we  assumed  to 
have  the  density  functions  h(xi\9i)  =  I(xi  >  0)  exp{— |f},  i  =  l,...,fc,  where  9  = 
(0i, . . . ,  =  (0,  oo)fc  and  1(A)  is  the  indicator  function  of  the  set  A.  Given  a  standard 

value  90  >  0  we  call  a  population  itj  good  if  9j  >  9q.  Our  aim  is  to  select  all  good  popu¬ 
lations.  Therefore  the  decision  space  is  D  =  {0,  l}fe  =  {(ai, . . . ,  a*,)  :  Oie{0, 1}}  and  7 r*  is 


3 


selected  if  and  only  if  a*  =  1.  Similar  as  in  Gupta,  Liang  (1996),  Gupta,  Liang  (1994)  we 
use  the  loss  function 

k 

L(9,a)  =  ^2e(0i,ai) 


i=i 


where 


£(0j,  a,i)  =  a,i9i(9o  -  9i)I( 0  <  9i  <  90)  +  (1  -  a,i)9i(9i  -  9q)I(9q  <  9i) 


By  a  selection  rule  d  =  (aj., . .  .,afc)  we  shall  mean  a  measurable  mapping  of  the  sample 
space  y  =  (0,  oo)fc  into  the  decision  space  O.  If  we  have  a  measurement  from  each 
population  7 r,  then  the  risk  of  the  selection  rule  d  is  given  by 

R(9,d)  =E L(9,d(Y)) 

where  Y_  =  (Yi, . . . ,  Yk).  In  terms  of  densities  the  risk  R(9,  d )  is  also  given  by 

RifL  d)  =  j  L(0,d(y))h(y,9)dy 

k 

where  h(y\9)  =  f]  h(Vi\di)  and  dV  =  dVi>  ■  ■  ■  dVk- 

i=i 

Using  the  loss  function  (2.1)  we  get 


(2.1) 

where 

(2.2) 


RW,d)  =  £«(<>,, «(y<)) 


1=1 


Qii.Vi')  E<3j  (Yl,  .  .  .  ,  Y-l—  1,  7/j,  Y+l;  •  •  •  ;  ^fc) 


The  formula  (2.1)  shows  that  due  to  the  special  structure  of  the  loss  function  we  may 
restrict  ourselves  to  randomized  decisions  qi  which  depend  on  the  data  of  irz  only.  Further 


(2.3) 


Vi 


E i(9i,qi(Yi))  =  J  qi(yi)(90  -  0*)  e  dyi  +  Q  ( 9{ ) 
o 


where  Ci{9i)  =  9t(9t  —  9q)I(9.1  >  9q).  As  we  will  apply  the  Bayes  and  the  empirical  Bayes 
approach  to  the  selection  problem  we  assume  that  the  9Z  are  realizations  of  independent 
random  variables  ©j.  The  0*  are  assumed  to  take  values  in  (0, oo)  and  have  distribution 
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Using  these  relations  we  obtain 


(Ooipn(yi)  -  ipi2(yi))qi(yi)dyi  +  7* 


where  7 *  is  independent  of  the  selection  procedure.  This  shows  that  inf  R(G,  d)  is  attained 

d 

by  the  selection  rule  d°  =  (d°, . . . ,  d°)  where 


(2.7) 


dlfa)  = 


1  if  <  V* *2(1/. ) 

0  otherwise 


If  Gj  is  nondegenerate  then  ~  is  strictly  increasing.  This  means  that  the  zero  r)l()  of 
Oq^Ph  —  ipi2,  if  there  is  any,  is  uniquely  determined.  To  apply  the  selection  rule  d°  we  have 
to  know  the  77^.  But  the  ipu,ip2i  as  well  as  77*0  include  the  unknown  prior  distribution. 
Otherwise  the  unknown  are  the  expectation  of  some  functions  of  the  observable 

data  Y{.  This  is  the  key  to  apply  empirical  Bayes  methods.  Assume  we  have  data  from  the 
past  which  can  be  taken  into  consideration.  More  precisely  let  Yu, ,  Yin,  i  =  1, . . .  ,k  be 
n  i.i.d.  random  variables  with  density 

r°°  1  a 

Ufa)  =  /  j-e-^dGiidi) 

Jo  Vi 

The  relations  (2.5)  and  (2.6)  show  that 


(2.8) 


II, Jm)  =  i  +  V  -  Y.tiKY.,  >  y) 

U  t- 1 


is  an  unbiased  and  consistent  estimator  for  the  unknown  function  Hi(y)  =  9oipn(y)  — 
il>i2  (y).  Using  this  fact  Gupta  and  Liang  (1996)  introduced  an  empirical  Bayes  selection 
procedure  d*  by  setting 


(2.9) 


diniVi) 


1  if  Hin{yi)  <  0 
0  otherwise 


3.  Results 


The  Bayes  risk  of  the  empirical  Bayes  selection  procedure  d*  is 

fc  roo 

R{G,d*n )  =EV'  /  d*in{yi)[9Qil>ii(yi)  -  ipi2(yi)]dyi  +  ji 

i= i 
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The  regret  risk  of  the  selection  procedure  is  then  given  by 


R(G,  d*J  -  R(G,  eg)  =  EV  [. I(Hin(yi )  <  0)  -  I(H,(yi)  <  0 )]J?i(W)cfW 

i= 1 

Gupta  and  Liang  (1996)  studied  the  rate  of  convergence  to  zero  of  the  above  nonnegative 
difference  and  proved 

R(G,  d*n)  —  R(G,  d°)  =  0(n~%)  for  some  0  <  A  <  2. 

In  this  paper  we  will  study  the  random  part  of  the  regret  risk.  We  will  not  directly  deal 
with  the  decision  rule  d*n .  According  to  (2.7)  the  essential  part  of  the  optimal  selection  rule 
d®  is  the  zero  rjio  of  the  function  Hi(y)  =  &oipn(y)  —  ipaiy)-  The  function  ^  is  strictly 
increasing  if  Gi  is  non-degenerated.  To  guarantee  that  Hi  has  a  zero  we  use  the  following 
Assumption  A  which  was  already  introduced  in  Gupta,  Liang  (1996). 

Assumption  A:  It  holds  lim  <  9q  <  lim  ^  . 

240  Vm  (y)  u  y-foo 

If  Assumption  A  is  fulfilled  and  r}iQ  is  the  uniquely  determined  zero  of  Ht  then 


k 

where  Rn  =  Yl[Mi(rjin  —  Mi{Vio)]  is  the  random  regret  risk.  To  prove  a  limit  theorem 

i=l 

for  the  random  regret  risk  Rn  we  need  at  first  a  y/n—  consistent  estimator  rjin  for  r]i0. 
As  the  functions  Hin(yi)  are  discontinuous  the  estimator  for  the  zero  of  Hi  can  not  be 
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directly  introduced  as  a  zero  of  Hin{yi )  as  the  existence  of  such  zeroes  is  unclear.  To  get 
a  continuous  function  we  integrate  Hin  and  set 

(3.3)  Minixji)  =  [  Hin(s)ds 

Jo 

As  Hin(s)  vanishes  for  all  sufficiently  large  s  we  see  that  is  constant  for  all  suffi¬ 

ciently  large  yi.  Taking  into  account  the  continuity  of  Min  we  get  the  existence  of  at  least 
one  maximum  point,  say  rjin.  From  the  law  of  large  numbers  we  get  that  Hin(s )  — >  H,t(s) 

n—*oo 

—  P  P 

and  Min(y)  — Mi(y),  where  — y  denotes  stochastic  convergence.  From  the  inclusion 

n— >  co 

(3-4)  {|  rjin  ~  Vio  |>  e}  C  {  sup  Min{y)  >  Min(rn0)} 

\y-Vio\>£ 

one  can  see  that  in  case  of  a  unique  maximum  of  Mi  at  ryo  and  a  uniform  law  of  large 
numbers,  i.e. 

sup  |  Min{y )  -  Mi(y)  |  -A  >  0 

y  n— >oo 

the  rjin  will  converge  to  t^q.  Starting  with  Wald  (1949)  this  concept  to  prove  consistency 
of  estimators  defined  by  minimization  or  maximation  procedures  have  been  used  by  many 
authors  (see  Pfanzagl  (1969),  Liese,  Vajda  (1994),  van  der  Vaart,  Wellner  (1996)  and  the 
references  there). 

Theorem  1:  Suppose  Gi  is  nondegenerate  and  (2.4)  holds.  Suppose  Assumption  A  is  ful¬ 
filled  and  rjine  argmax  Min,  where  Min  and  Hin  are  defined  in  (3.3)  and  (2.8),  respectively. 
Then 

^  P 
Vin  t  T]io 
n-A-oo 

where  rjio  is  the  uniquely  determined  zero  of  Hfiy)  =  ^o^ii(y)  —  4’i2(y)- 

The  traditional  way  to  prove  asymptotic  normality  of  estimators  defined  by  a  mini¬ 
mization  procedure  is  to  get  an  equation  for  this  estimator  by  taking  the  derivative.  The 
next  step  is  a  linearization  of  the  obtained  equation  by  Taylor  expansion.  This  approach 
fails  in  our  situation  as  the  function  Min  is  not  differentiable.  But  the  one  sided  derivatives 
exist.  The  derivative  from  the  right  D+Min(yi )  exists  for  every  y,  >  0  and  the  derivative 
from  the  left  D~Min(yi)  exists  for  every  yi  >  0.  If  rjin  is  a  maximum  point  of  il/in  we  get 

1  . n  . 

D+Min(rjin )  =  -  yVflo  +  fin  -  Yu)I(Yu  >  rjin ) 
n  c — 4 
«=i 

=  Hinijlin)  S  0 
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and  for  rjin  >  0 


D  Min(rjin)  =  i  J^(0o  +  {Vin  -  Yu)  I  {Yu  >  rjin ) 
n  i= 1 

=  Hin  {rjin  -  0)  >  0 

Note  that 

(3.5)  D+Min(rjin )  -  D-Min(rjin)  =  =  **") 

i- 1 

If  the  Y«  have  continuous  distributions  then  with  probability  one  at  most  one  term  in  the 
sum  of  the  right  hand  side  of  (3.5)  is  nonzero.  Denote  by  Y i(X)  <  Y^2)  <  . . .  <  Y ■(„)  the 
order  statistics.  If  rjin  =  Yj(r_x)  then  for  Yj(r_x)  <  y  <  Yj(r) 

D+Mi„(y)  =  -  YjA<  +  V-  y.liHY.t  >  y) 

=  ^X>  +  V~Ym) 

l=r 

>  “  +  *V-1)  “  *i(*)) 

i~T 

>  D+Min(Yi(r-i))  >  0 

which  contradicts  the  maximum  property  of  rjin-  Hence  on  the  event  {rjin  >  0}  the  right 
hand  term  in  (3.5)  is  a.s.  zero.  Consequently 

(3.6)  Hin{rjin)I{rjin  >  0)  =  0  a.s. 

The  function  Hin{y )  is  discontinuous.  This  excludes  a  linearization  of  (3.6)  by  Taylor 
expansion.  But  for  large  n  we  have  Hin{y )  «  E Hu  =  9o'tpll{y)-'ipi2{y)-  As  #o'0u(z/)-V;t2(y) 
is  smooth  we  will  be  able  to  derive  an  asymptotic  linearization.  The  idea  of  an  asymptotic 
linearization  goes  back  to  Pollard  (1990)  and  was  systematically  used  in  Jureckova  and 
Sen  (1996)  to  deal  with  regression  models. 

Denote  by  the  variance  of  the  r.v.X  by  V{X).  Using  integration  by  parts  one  can 
show  that  (2.4)  implies  EY?  <  oo.  We  set 

(3-7)  at  =  V((9q  +  Vio  -  Yu)  I  {Yu  >  yi0)) 
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Denote  by  N(p,cr2)  the  distribution  with  expectation  p  and  variance  a2. 
Theorem  2:  Assume  the  conditions  in  Theorem  1  are  fulfilled  and  it  holds 

(3.8)  bi  =  -[0o$iOfco)  -  tiaiVio)]  >  0 


Vn{rjin  ~  Vio )  — >  m^o/bl) 

n—¥  oo 

where  denotes  convergence  in  distributions. 


Now  we  apply  Theorem  2  to  evaluate  the  random  part  Rn  in  the  regret  risk  E Rn .  It 


holds 


Rn  =  [Mi (rfco)  “  Mi(%n)\ 


Note  that  M-(rjio)  =  0  and  by  assumption  (3.8)  —M['{rjia)  =  —H^r/io)  =  b.t  >  0.  Hence 
by  Theorem  2 

k 

(3.9)  nRn  =  ^(77in  -  r7io)2[^  +  oP(  1)] 


Let  £(W)  denote  the  distribution  of  the  random  variables  X.  We  denote  by  xL  ■  --iXl 
i.i.d.  random  variables  whose  common  distribution  is  a  x2~  distribution  with  one  degree 
of  freedom.  Then  we  get  from  (3.9)  Theorem  2  and  the  Slutzky  theorem  the  following 
result. 

Theorem  3:  If  the  assumptions  in  Theorem  2  are  fulfilled  ,  din  is  defined  in  (3.1)  and 

k  poo  k  poo 

Rn  'y  ^  I  din(yi)Hi(yi)dyi  ^  (  I  d j  ( yi)Hi{yi)dyi 

t= i  Jo  i= i  Jo 

is  the  random  part  in  the  regret  risk  (3.2),  then 

(off)  Xi) 
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4.  Proofs: 


As  Gi  is  nondegenerate,  the  function  ^  continuous  and  is  strictly  increasing.  Hence 
by  Assumption  A  the  function  Hi(y)  has  a  uniquely  determined  zero  rjiQ  and  the  following 
holds 

Hi(y)  >  0  for  y  <  r?i0  and  Hi(y)  <  0  for  y  >  rji0 


Consequently 


Hi(s)ds 


has  a  uniquely  determined  maximum  of  y  =  rjiQ.  If  we  know  that  Min  converges  uniformly 
to  Mi  then  we  will  expect  that  the  maximum  points  of  Min  being  the  %n  will  tend  to  the 
minimum  point  ryo  of  Mi .  More  precisely 


(4.1) 

Also 


We  set 


{|  Vin  ~  Viol  >  e}  C  {  sup  Min(y)  >  Min(rji0)} 

y-\y—mo\>e 

C  {  sup  ( Min(y )  -  Min(rji0))  >  0} 

y-\y-Vto\>e 

sup  (Min (y)  -  Min(r]io))  < 

y-\y-T)io\>e 

sup  ( Mi(y )  -  Mi(rjio))  +  2  sup  \Min(y)  -  Mi(y)\ 
y-\y-yia\>z  0<y<oo 


(4.2)  5e=  sup  (M^ryo)  -  Mi(y)) 

y-\y-Vio\>e 

and  note  that  Se  >  0  as  Mi  is  strictly  increasing  for  y  <  rjiQ  and  strictly  decreasing  for 
y  >  rjio.  The  inclusion  (4.1)  implies 

(4-3)  {|  rjin  -  rjio\  >  e}  C  {  sup  \Min(y)  -  Mi(y)\  >  i<5e} 

0<y<oo  ^ 

which  shows  that  a  uniform  law  of  large  numbers  implies  the  consistency  of  rjin. 

Lemma  1.  Assume  Gi  is  nondegenerate  and  it  holds  (2.4)  and  Assumption  A.  If  Min 
is  defined  by  (3.3)  and  argmax  Min  then 

^  d 

Vin  ^  ViO 
n— >oo 
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Proof:  Note  that 


(4.4) 


_  rv  ^ 

Min(y)  =  /  Min(s)ds 
Jo 

=  fy  -  f>0  +  ^  -  y,)/(y*  >  s)dS 

Jo  nfel 

=  -  y^(y»i  A  y)(0o  ~  +  2  (*»*  A  y)) 


£=i 


where  Yu  Ay  =  min(Y^, y).  Consequently 


Min( 0°)  =  lim  Afc»(y) 

y->oo 

=  1  f>(<>o  -  5IW 


t= l 


and 


E  sup  |Min(y)  -  Min(oo)|  < 

T<y<cx> 


sup  ^ol(y«  Ay)  -  y*|  +  E  sup  Yie\(Yu  Ay)  -  Yu\  • 

T<y<o o  T<y<o o 


+  E  sup  i  |(K«A!/)2-y,?)l 

T<y< oo  ^ 


(4.5) 


X>*o  (y«  -  T)/(y,  >  t)  +  e  y*(y*  -  r)/(y«  >  r) 

<=i 


+  i  E(yJ  -  T2)I(Yu  >  T)] 
=  e0E {Yn  -  T)I{Yn  >  T)  +  EFa  (ya  -  T)I{Yix  >  T) 

+  l-E{Y2~T2)i{Yil>T) 


The  assumption  (2.4)  implies  EY^  <  oo  and  the  theorem  of  Lebesgue  yields  that 
(4.6)  lim  E  sup  |  Min{y )  -  Min(oo)  |=  lim  At  =  0 

oo  T<y<  co  T^°° 
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Now  we  study  Mjn(y)  for  0  <  y  <  T  where  T  >  0  is  fixed.  It  holds 


_  ny+h _ 

Min(y  +  h)  -  Min(y)  |=j  /  Min{s)ds 

Jy 


py±n  -I  “ 

<  /  -53|6>o  +  s-yi/|/(y«>s)da 

•'J'  n  e=i 

+  +  h)  +  Yu) 


1=1 


Hence 


(4.7) 


E  sup  \  Min(y  +  h)  —  Min(y)  \<  h(90  +  h)  +  hEYiT 

y,h:0<y<y+h<T 


For  any  function  /  :  [0,T]  — >  (— oo,+oo)  we  denote  by  Uf(h)  the  modul  of  continuity 
defined  by 


w/,t(A)  =  sup  |  f{y  +  h)-f(y) 

y  ,h.O<y<y+h<T 

Then  for  every  n  =  1,2,... 


sup 

0  <y<T 


Mtn(y)-Mi(y)  |< 


max 

0  <k<m 


Min{  )  —  Mi  (  ) 

m  m 


+  OJ 


Min,T 


Hence 


(4.8) 


^  771  _  J>£ 

E  sup  |  Min(y)  -  Mi{y)  |  <  J^E  |  Min{ — )  -  Mj(— )| 

0<j/<T  ^  w 
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and  by  (4.7),  (4.8) 


E  sup  |  Min (y)  -  Mi{y)  |  <  E  sup  |  Min{y)  -  Mi(y )  | 

0<y<oo  0<y<T 


+  E  sup  j  Min(y)  -  Min( co)  | 

T<y<o o 

+  E  |  Min( oo)  -  Min(co )  | 

+  sup  |Mj(y)  —  Mj(oo))  | 

T<y<  oo 


A  „  ,  ,Tfc,  .Tfc. 

<  V  E  Mi„( —  -  — 

1 '  m  ™ 


k= 0 


m 


r/fl  t  .  t  , 

H - (#o  H - )  4 - EYin 

m  mm 

T  — 

wMl>T(— )  +  At  +  E  |  Min (oo)  ~  Mj(oo) 
m 

Given  a  >  0  we  may  choose  To  such  that  by  (4.6) 

(4.9) 


.  a 
At»<  3 


Then  we  fix  m0  such  that 

To 


(4.10) 


0  In  .  To  .  Tq  .  XQ  .  Oi 

—  (do  +  — -)  +  — -  E  Yin  +  WMi>To(  — )  <  - 
toq  mo  mo  mo  o 


To 


a 


Here  we  have  used  that  Mi  is  uniformly  continuous  on  [0,  To].  By  the  law  of  large  numbers 
we  find  n0  such  that  for  every  n  >  n q 


mo 


(4.11) 


Ve  !  M)n  (— — -)  -  Mi(— )  I  +E  I  Min  (oo)  -  M<(oo)  |<  | 
^  mo  mo  3 


,  T0/c  , 


The  combination  of  (4.9),  (4.10),  (4.11)  yields 


E  sup  |  Min(y)  -  Mi(y)  |<  a 

0<y<oc 


for  every  n  >  uq.  This  yields 


P({  sup 

0<y<oo 


Z  n— >■  oo 
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which  implies  the  consistency  of  %n  in  view  of  (4.3). 

The  classical  method  to  deal  with  consistent  estimators  which  fulfill  an  equation  is  the 
linearization  of  the  equation  by  Taylor  expansion.  However  this  technique  is  not  applicable 
to  (3.6)  as  the  corresponding  functions  are  not  differentiable.  They  are  even  not  continuous. 
We  will  use  another  approach  and  derive  a  more  implicit  representation  by  a  stochastic 
process.  To  prepare  this  representation  of  %n  we  need  some  technical  lemmas.  The  first 
is  the  result  of  straight  forward  calculations. 

Lemma  2:  Let  (Ui,Vi),i  =  1  ,...,n  be  i.i.d.  random  vectors  with  E Uf  <  co,EV)2  <  oo 
and  WJi  =  EVi  =  0.  Then 

E(4  E  ^)2(4  E  * <)2  =  4?  E^x®?) 

V  k=  1  V  e=l 

(4.12)  +2j2(EUkVk)(EUeVl)  + 

k^e  U  1=  1 

<  ‘M'U  ~  ^  (E1712)(EV12)  +  -EU?  V? 

n  n 

Set 

(4.13)  Aij(s)  =  (90  +  s-  Yij)I(Yij  >  s ) 

We  fix0<«<6<co  and  consider  the  stochastic  processes  Aij(s)  in  [a,  b].  It  holds  for 
a  <  s  <  t  <  b 

(4.14)  Aij(t)  -  Aij(s)  =  ( Yij  -  9q  -  s)I(s  <  Yij  <t)  +  (t-  s)I(Yij  >  t ) 

Hence  by  (a  +  /3)2  <  2(a2  +  01) 

E [(Aij(t)  -E Aij(t))  -  ( Aij(s )  -  EA^-(s))]2  <  E(Atj (t)  -  Aij(s))2 

(4.15) 

<2[0o  +  25)2(Fi(t)-Fi(s))  +  (t-s)] 
where  F*  is  the  c.d.f.  of  Y^ ,  j  =  1, . . . ,  n. 


15 


Furthermore  for  a<s<t<u<b 

(Aij)(t)  —  Aij(s))(Aij(u)  —  Aij(t )) 

=[(Yy  -0Q-  s)I{s  <  Yij  <t)  +  (t-  s)I (Yij  >  f)]. 

[(Yij  -  6*o  -  t)I(t  <  Y^  <u)  +  (u-  t)I(Yij  >  u)] 

=(t  -  s)(Yij  -dQ-  t)I(t  <  Y^  <  u ) 

+  (t  -  s)(u  -  t)I(Yij  >  u) 

Using  again  (a:  +  (3)2  <  2 (a2  +  /32)  we  obtain  for  any  r.v  X,  Y  with  finite  fourth  moment 
E(X  -  E X)2(Y  -  E Y)2  <  4E(X2  +  (EJ£)2)(Y2  +  (EY)2) 

<  4(EX2Y2  +  (EX)2EY2  +  (EY)2EX2  +  (E X)2(EY)2)  <  16EX2Y2 

Putting 

X  =  (t  -  s)(Yij  -  d0  -  t)I(t  <  Y^  <  u ) 

Y=(t-s)(u-t)I(Yij  >u) 

we  arrive  at 

E[(Ay  ( t )  -  E Aij  (*))  -  (^-(s)  -  E Aij  (s))]2[(^y(u)  -  E Atj  («))  -  (Aij(t)  -  E A{j  (t))]2 

<  16  E(£  —  s)2[(Yij  —  do  —  t)I{t  <  Y^  <  u)  +  (u  —  t)I{Yij  >  u)}2 

<  16  (t  -  s)22E({Yij  -  do-  t)2I(t  <  Yij  <u)  +  (u-  t)2) 

(4.16)  <  16(t-s)22((60  +  26)2  +  62) 

Recall  that  Yij  has  the  density 

fi{Vi)  =  [  w-  e~^dGi{di) 

JO  “i 

We  introduce  the  continuous  nondecreasing  function  Ki(t),a  <  t  <  b,  by 

(4.17)  Ki(t)=  C  D(fi(s)  +  l)ds 

J  a 

where  D  =  32(#o  +  2b)2  +  b2  +  2  Then  by  (4.15)  and  (4.16) 

(4.18)  E[(%(£)  -  E Ai:j(t))  -  (Aij(s)  -  E^-(s))]2  <  K,(t)  -  Ki(s) 
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Ep4j(i)  -EAy(i))  -  (A,M  -*Mj3-(S))]2[(^(«)-MyW)(4i:j(t)  -ILMi))]2 

(4.19)  <  (Ki(t)  -  K,(s))2. 

Introduce  Win(t),a  <t<b,by 

1  n 

win(t )  =  5>i(f)  -  EiliiW) 

v  n  J=1 

Then  by  the  definition  of  Hin(t)  and  we  have 

(4-20)  Win(t)  =  y/n  ( Hin(t )  -  Hi(t)) 


For  fixed  a<s<t<u<b  we  apply  Lemma  2  to 

Uj  =  (Ajj(t)  —  EAjj  (£))  —  (s)  —  EAjj(s)) 

V}  =  ( Aij(u )  -  E Aij  ( u ))  -  (Ay  (i)  -  Ei4i;?-  (£)) 


to  get 


E[(Win(t)  -  Win(s)){Win(u)  -  Win(t))}2  <  3-—  1}E U2V2  +  - WJlV, ? 

n  n 

The  application  of  the  inequalities  (4.18),  (4.19)  to  right  hand  terms  yields 
(4.21)  E [(Win(t)  -  W’in(5))(W<„(tt)  -  Win(t ))]2  <  3 (Ki(u)  -  K^s))2 


We  have  also  to  deal  with  the  fourth  moments  of  the  process  Win.  To  this  end  we  use  the 
following  well  known  formula  for  the  fourth  moment  of  a  sum  of  i.i.d.  random  variables. 
Lemma  3:  Let  Zi,...,Zn  be  i.i.d.  random  variables  with  E Z\  =  0,<r2  =  E Z2  and 


/i4  =  E Zf  <  oo.  Then 


1 

H — A4- 
n 


Now  we  set 


Zij  =  ( Aij(t )  -E Aij(t))  -  (Aij(s)  -EAtj(s)) 


and  note  that  by  (4.18) 

a2  =  EZ2-  <  Ki{t)  -  Ki{s ) 
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The  representation  (4.14)  shows  that  the  r.v.  Zij  are  bounded 


1  %ij\  <  4(6  —  a  +  9q) 

Hence 

£4  <  (4(6  -  a  +  90))2a2 

and  with  d  =  (4(6  —  a  +  9q))2 

<  d(Ki(t)  -  Ki(s)) 

If  we  now  apply  Lemma  3  to  W{n  then 

(4.22)  E (Win(t)  -  Win(s))4  <  3 (Ki(t)  -  K^s))2  +  ^( K^t )  -  K^s)) 

Recall  that  the  model  of  continuity  Uf(h)  of  a  function  of  [a,  6]  is  defined  by 

Uf(h)  =  sup  |  (f(t  +  h)  -  f(t)  | 

a<t<t+h<b 

For  the  proof  of  the  next  lemma  we  refer  to  Shorack  and  Wellner  (1986),  p.  49.  Suppose 
Z{t),  0,  <  t  <  1  is  a  stochastic  process  for  which  every  path  is  continuous  from  right  and 
has  limits  from  the  left. 

Lemma  4:  Assume  there  is  a  continuous  nondecreasing  function  K  on  [0, 1]  such  that 

(4.23)  E (Z(t)  -  Z{s))2{Z(u)  -  Z(t))2  <  ( K{u )  -  K{t))2 
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In  view  of  (4.21)  the  condition  (4.23)  is  fulfilled.  Note  that  by  the  definition  of  Z  and 
inequality  (4.22) 


£  E(Z(±)-Z(i-I))4 

m  m 

fc=i 

^  /  (£>  — a),.  Tr  /  (6  —  a)(A;  —  1) .  2  d 

3 (Ki(a-\ - k)  —  Ki(a-\ - ))  4 — K{(b) 

f — 1  m  m  n 

«==  1 

£(**.(^2) +  £)*<») 

Consequently  by  (4.24) 

(4.25)  PW^)  ^  «)  £ 

Now  we  are  ready  to  prove  an  asymptotic  representation  for  the  estimator  rjin. 

Lemma  5:  Under  the  assumptions  of  Theorem  2  it  holds 

Vn  Hi(r}in )  =”  Win(r)io)  +  pn 
where  pn  0. 

n— >oo 

Proof: 

Fix  0  <  a  <  rjio  <  b  <  oo.  Then  by  the  consistency  of  rjin  we  get  P(rjine[a,b})  — >  1.  If 

Tl—rOC 

rfine[a ,  b]  then  by  (3.6)  and  (4.20) 

0  =  Hin(rjin)y/n 
=  lF)n(?7in)  4"  y/ri  Hopin') 

—  4“  ^di(jlin)  4“  ^in(^7io)) 

Introduce 

Cmn  =  sup  |l/Fjn(t)  PFin(r7i0)| 

l*-Uio|<*=a 

_  r  .  b  —  a. 

Dmn  ~  \p7in  ViO  \  —  / 

m 
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Then 


or 


I {Dm.n)[Win(T]io)  4”  y/n  Hi{rjin)  +  (W^iniVin)  Win  {Vio))}  —  0 


Win(r]io)  4~  \fn  (j]in )  —  Pin  4"  p2n 


where 


Pin  —  I  {D  mn)\Win{Pv o)  T  \fn  Hi(l 7jn)] 

P2n  =  I  {Dmn)  (Win  (j]in)  Winijli o) 

To  show  pin  — y  0  it  suffices  to  remark  that  by  the  consistency  of  r)in: 


But  by  (4.25) 


P(Dmn)  - >  0 

n— >  oo 


P(lft„l>«)<n"w,»(^)>e) 

fit 


As  Ki  is  continuous  we  see  UK;  (~~)  — y  0.  Taking  at  first  n  ~y  oo  and  then  m  — >  oo  we 

'  m  '  m-+  oo 

see  that 

■P(|  P2n  I  >  e)  — >  0 

n— >-00 

p 

Hence  pin  4-  p2n  — ^  0  which  proves  the  statement. 

n— *  oo 

Proof  of  Theorem  2:  As  #1(77*0)  =  0  and  #*(y)  =  0O  ‘•Pin (y)  ~  'Piniu) 
is  differentiable  at  y  =  77*0  we  obtain  from  the  consistency  of  rjin  and  Lemma  5 


\fp  {'Pin  Vio){HiiVio)  4“  Sn)  —  B^jn(77io)  4-  pn 

where  both  Sn  and  pn  tend  stochastically  to  zero.  By  the  central  limit  theorem  the 
distribution  of  W*n (77^0)  tends  to  iV(0,Oo)  as  n  — y  00.  The  application  of  the  Slutzky 
Lemma  yields  the  statement. 
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